Picture for Muhao Chen

Muhao Chen

University of California Davis

Reinforced Attention Learning

Add code
Feb 04, 2026
Viaarxiv icon

ReasoningBomb: A Stealthy Denial-of-Service Attack by Inducing Pathologically Long Reasoning in Large Reasoning Models

Add code
Jan 29, 2026
Viaarxiv icon

Unbiased Visual Reasoning with Controlled Visual Inputs

Add code
Dec 19, 2025
Viaarxiv icon

FRIEDA: Benchmarking Multi-Step Cartographic Reasoning in Vision-Language Models

Add code
Dec 08, 2025
Viaarxiv icon

Optimizing Diversity and Quality through Base-Aligned Model Collaboration

Add code
Nov 07, 2025
Figure 1 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Figure 2 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Figure 3 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Figure 4 for Optimizing Diversity and Quality through Base-Aligned Model Collaboration
Viaarxiv icon

ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation

Add code
Oct 09, 2025
Figure 1 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 2 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 3 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Figure 4 for ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Viaarxiv icon

False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize

Add code
Sep 04, 2025
Figure 1 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 2 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 3 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Figure 4 for False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
Viaarxiv icon

Code Execution as Grounded Supervision for LLM Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA

Add code
Jun 09, 2025
Figure 1 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 2 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 3 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Figure 4 for QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Viaarxiv icon

DiscoSum: Discourse-aware News Summarization

Add code
Jun 07, 2025
Viaarxiv icon